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Introduction 


¢ Enhanced Sixth-Generation ¢ X86 Instruction Set Includes 
Architecture MM X™ Instructions 

- Compatible with MMX"™ Technology 

- Runs Windows® 95, Windows 3.x, Windows NT. 
DOS UNIX®, O8/2®, Solaris®, and others 


- MIL-300 and higher 
- 64K4-Way Unified Write-Back Cache 
- 2 level TLB(16 Entry L1, 384 Entry 12) 


- Branch Prediction with a512-entry BIB ¢ Other Features 

- Enhanced Memory Management Unit - Socket 7 Pinout Compatible 

- Scratchpad RAM in Unified Cache - 2.9 V Cor, 3.3 VO 

- Optimized for both 16- and 32-Hit Code - Hexible Core/Bus Clock Ratios (2x, 2.5x, 3x, 3.5x) 
- High Performance 80-Bit FPU - Leverages Existing Socket Infrastructure 


The Cyrix MII” processor isan enhanced processor The MII processor achieves top performance through 
with high speed performance. This processor has a the use of two optimized superpipelined integer 

64K unified write-back cache, atwo- level TLBanda —_ units, an on-chip floating point unit, and a 64 KByte 
512-entry BTR The MII CPU contains ascratchpad — unified write-back cache. The superpipelined archi- 
RAM feature, supports performance monitoring, and _ tecture reduces timing constraints and increase 
allows caching of both SMI code and SMI data. It frequency scalability Advanced architectural 


delivers high 16- and 32-bit performance while techniques include register renaming, out-of-order 
running Windows 95, Windows NT, OS/2, DOS completion, data dependency removal, branch 
UNIX, and other operating systems prediction and speculative execution. 
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ie ARCHITECTURE 
OVERVIEW 


The Cyrix M Il processor operates at higher 
frequencies than the 6x86MX™ processors 
The MII processor, based on the proven 6x86 
core, is superscalar in that it contains two 
separate pipelines that allow multiple 
instructions to be processed at the same time. 
The use of advanced processing technology 
and superpipelining (increased number of 
pipeline stages) allow the M II CPU to achieve 
high clocks rates. 


Through the use of unique architectural 
features, the M II processor eliminates many 
data dependencies and resource conflicts, 
resulting in optimal performance for both 
16-bit and 32-bit x86 software. 


For maximum performance, the M IIT CPU 
contains two caches, a large unified 64 KByte 
4-way set associative write-back cache and a 
small high-speed instruction line cache. 


To provide support for multimedia operations, 
the cache can be tumed into a scratchpad RAM 
memory on a line by line basis. The cache area 
set aside as scratchpad memory acts as a 
private memory for the CPU and does not 
participate in cache operations. 


Product Overview 


MII" PROCESSOR 
Enhanced High Performance CPU 


Within the M II processor there are two TLBs, 
the main Li TLB and the larger 12 TLB The 
direct-mapped Li TLB has 16 entries and the 
6-way associative L2 Tl Bhas 384 entries. 


The on-chip FPU has been enhanced to 
process MMX instructions as well as the 
floating point instructions. Both types of 
instructions execute in parallel with integer 
instruction processing. To facilitate FPU opera- 
tions, the FPU features a 64-bit data interface, 
afour-deep instruction queue and a six-deep 
store queue. 


The CPU operates using a split rail power 
design. The core runs on a 2.9 volt power 
supply, to minimize power consumption. 
External signal level compatibility is main- 
tained by using a 3.3 volt power supply for the 
VO interface. 


For mobile systems and other power sensitive 
applications, the M II processor incorporates 
low power suspend mode, stop clock capa 
bility, and system management mode (SMM). 
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1.1 Major Functional 
Blocks 


The M II processor consists of four major func- 
tional blocks, as shown in the overall block 
diagram on the first page of this manual: 


¢ Memory Management Unit 
¢ CPU Core 

¢ Cache Unit 

¢ Bus Interface Unit 


The CPU contains the superpipelined integer 
unit, the BIB (Branch Target Buffer) unit and 
the FPU (Hoating Point Unit). 


The BIU (Bus Interface Unit) provides the 
interface between the external system board 
and the processor's internal execution units. 
During a memory cycle, a memory location is 
selected through the address lines (A31-A3 
and BE7# -BED#). Datais passed from or to 
memory through the data lines (D63-D0). 


Fach instruction is read into 256- Byte Instruc- 
tion Line Cache. The Cache Unit stores the 
most recently used data and instructions to 
allow fast access to the information by the 
Integer Unit and FPU. 


The CPU core requests instructions from the 
Cache Unit. The received integer instructions 
are decoded by either the X or Y processing 
pipelines within the superpipelined integer 
unit. If the instruction is a MMX or FPU 
instruction it is passed to the floating point 
unit for processing. 


As required data is fetched from the 64-KByte 
unified cache. If the data is not in the cache it 
is accessed via the bus interface unit from main 
memory. 


The Memory Management Unit calculates 
physical addresses including addresses based 
on paging. 

Physical addresses are calculated by the 


Memory Management Unit and passed to the 
Cache Unit and the Bus Interface Unit (BIU). 
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142 Integer Unit ¢ Instruction Decode 2 (ID2) 
; : e Address Calculation 1 (AC1) 
The Integer Unit (Figure 1-1) provides © Address Calculation 2 (AC2) 
parallel instruction execution using two © Execute (EX) 
seven- stage integer pipelines. Each of the © Write Back (WB) 
two pipelines, X and Y, can process several 
instructions simultaneously. The instruction decode and address calcula- 
The Integer Unit consists of the following tion functions are both divided into superpipe- 
pipeline stages: lined stages. 


¢ Instruction Fetch (IF) 
¢ Instruction Decode 1 (1D1) 


Instruction Fetch 
Instruction Decode 1 
In-Order Instruction Instruction 
Processing Decode 2 Decode 2 


Address Address 
Calculation 1 Calculation 1 


Address Address 
Calculation 2 Calculation 2 
Execution Execution 
Write-Back Write-Back 


X Pipeline Y Pipeline 


Out-of-Order 
Completion 


1727301 


Figure 1-1. Integer Unit 
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1.2.1 Pipeline Stages 


The Instruction Fetch (IF) stage, shared by 
both the X and Y pipelines, fetches 16 bytes of 
code from the cache unit in a single clock 
cycle. Within this section, the code stream is 
checked for any branch instructions that could 
affect normal program sequencing. 


If an unconditional or conditional branch is 
detected, branch prediction logic within the IF 
stage generates a predicted target address for 
the instruction. The IF stage then begins 
fetching instructions at the predicted address. 


The superpipelined Instruction D ecode 
function contains the ID1 and ID2 stages. 
ID1, shared by both pipelines, evaluates the 
code stream provided by the IF stage and 
determines the number of bytes in each 
instruction. Up to two instructions per clock 
are delivered to the ID2 stages, one in each 
pipeline. 

The ID2 stages decode instructions and send 
the decoded instructions to either the X or Y 
pipeline for execution. The particular pipeline 
is chosen, based on which instructions are 
already in each pipeline and how fast they are 
expected to flow through the remaining pipe- 
line stages. 


The Address C alculation function contains 
two stages, AC1 and AC2. If the instruction 
refers to amemory operand, the AC1 calcu- 
lates a linear memory address for the instruc- 
tion. 


The AC2 stage performs any required memory 
management functions, cache accesses, and 
register file accesses. If a floating point instruc- 
tion is detected by AC2, the instruction is sent 
to the FPU for processing. 


The Execute (EX) stage executes instructions 
using the operands provided by the address 
calculation stage. 


The W xrite-Back (WB) stage is the last IU 
stage. The WBstage stores execution results 
either to a register file within the IU or to a 
write buffer in the cache contro! unit. 


1.2.2 Out-of-Order 
Processing 


If an instruction executes faster than the 
previous instruction in the other pipeline, the 
instructions may complete out of order. All 
instructions are processed in order, up to the 
EX stage. While in the EX and WBstages, 
instructions may be completed out of order. 


If there is a data dependency between two 
instructions, the necessary hardware interlocks 
are enforced to ensure correct program 
execution. Even though instructions may 
complete out of order, exceptions and writes 
resulting from the instructions are always 
issued in program order. 
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1.2.3 Pipeline Selection 


In most cases, instructions are processed in 
either pipeline and without pairing constraints 
on the instructions. However, certain instruc- 
tions are processed only in the X pipeline: 


© Branch instructions 
e Floating point instructions 
e Exclusive instructions 


Branch and floating point instructions may be 
paired with a second instruction in the Y pipe- 
line. 


Exclusive Instructions cannot be paired with 
instructions in the Y pipeline. These instruc- 
tions typically require multiple memory 
accesses. Although exclusive instructions may 
not be paired, hardware from both pipelines is 
used to accelerate instruction completion. 
Listed below are the M II CPU exclusive 
instruction types: 


¢ Protected mode segment loads 
Special register accesses 
(Control, Debug, and Test Registers) 
String instructions 
Multiply and divide 
V/O port accesses 
Push all (PUSHA) and pop all (POPA) 
Intersegment jumps, calls, and returns 


1.2.4 Data Dependency 
Solutions 


When two instructions that are executing in 
parallel require access to the same data or 
register, one of the following types of data 
dependencies may occur. 


© Read-After Write (RAW) 
© Write After-Read (WAR) 
¢ Write-After-Wnite (WAW) 


Data dependencies typically force serialized 
execution of instructions. However, the M II 
CPU implements three mechanisms that allow 
parallel execution of instructions containing 
data dependencies: 


¢ Register Renaming 
¢ Data Forwarding 
e Data Bypassing 


The following sections provide detailed exam- 
ples of these mechanisms. 


1.2.4.1 Register Renaming 


The M II CPU contains 32 physical general 
purpose registers. Each of the 32 registers in 
the register file can be temporarily assigned as 
one of the general purpose registers defined by 
the x86 architecture (EAX, ERX, HCX, EDX, 
ESI, EDI, EBP and ESP). For each register 
write operation a new physical register is 
selected to allow previous data to be retained 
temporarily. Register renaming effectively 
removes all WAW and WAR dependencies. 
The programmer does not have to consider 
register renaming as register renaming is 
completely transparent to both the operating 
system and application software. 
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Exam ple #1 —RegisterRenam ing Elm inates W riteA fterRead WAR) Dependency 


A WAR dependency exists when the first in a pair of instructions reads a logical register, and the 
second instruction writes to the same logical register. This type of dependency is illustrated by the 
pair of instructions shown below: 


X PIPE Y PIPE 
(1) MOV BX, AX — (2) ADD AX, CX 
BX< AX AX — AX+CX 


Note: In this and the following examples the original instruction order is shown in parentheses. 


In the absence of register renaming, the ADD instruction in the Y pipe would have to be stalled to 
allow the MOV instruction in the X pipe to read the AX register. 


The M II CPU, however, avoids the Y pipe stall (Table 1-2). As each instruction executes, the 
results are placed in new physical registers to avoid the possibility of overwriting a logical register 
value and to allow the two instructions to complete in parallel (or out of order) rather than in 
sequence. 


Table 1-1. Register Renaming with WAR Dependency 


i 
Gita) fax fom pce 


comma wee | | am 
pacer [Le Pm [a | ¥ [cman 


Note: The representation of the MOV and ADD instructions in the final column of Table 1-2 
are completely independent. 
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Exam ple #2 —RegisterRenam ing Elm inates W riteAfterw rite (W AW ) Dependency 


A WAW dependency occurs when two consecutive instructions perform writes to the same logical 
register. This type of dependency is illustrated by the pair of instructions shown below: 


X PIPE Y PIPE 
(1) ADD AX, BX ~— (2) MOV AX, [mem] 
AX AX + BX AX < [mem] 


Without register renaming, the MOV instruction in the Y pipe would have to be stalled to guar- 
antee that the ADD instruction in the X pipe would write its results to the AX register first. 


The M II CPU uses register renaming and avoids the Y pipe stall. The contents of the AX and BX 
registers are placed in physical registers (‘Table 1-3). As each instruction executes, the results are 
placed in new physical registers to avoid the possibility of overwriting a logical register value and 
to allow the two instructions to complete in parallel (or out of order) rather than in sequence. 


Table 1-2. Register Renaming with WAW Dependency 


Physical Register Contents 


a ae 
Ee a ee ee 
ee os eS ee ae ee ees 
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woven | [mL | Y [Recto 


Note: All subsequent reads of the logical register AX will refer to Reg 3, the result of the MOV 
instruction. 
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1.2.4.2 Data Forwarding 


Register renaming alone cannot remove RAW 
dependencies. The M II CPU uses two types of 
data forwarding in conjunction with register 
renaming to eliminate RAW dependencies: 


¢ Operand Forwarding 
e Result Forwarding 


O perand foxrw arcing takes place when the 
first in a pair of instructions performs a move 
from register or memory, and the data that is 
read by the first instruction is required by the 
second instruction. The M II CPU performs 
the read operation and makes the data read 
available to both instructions simultaneously. 


Result forw arding takes place when the first 
in a pair of instructions performs an operation 
(such as an ADD) and the result is required by 
the second instruction to perform amove to a 
register or memory. The M II CPU performs 
the required operation and stores the results of 
the operation to the destination of both 
instructions simultaneously. 


1-8 PRELIMINARY 


| 


Exam ple #3 —O perand Forw arding Elim inates Read fterwW rite (RAW ) Dependency 


A RAW dependency occurs when the first in a pair of instructions performs a write, and the 
second instruction reads the same register. This type of dependency is illustrated by the pair of 
instructions shown below in the X and Y pipelines: 


xX PIPE Y PIPE 
(1) MOV AX, [mem] (2) ADD BX, AX 
AX < [mem] BX< AX + BX 


The M II CPU uses operand forwarding and avoids a Y pipe stall (Table 1-4). Operand 
forwarding allows simultaneous execution of both instructions by first reading memory and then 
making the results available to both pipelines in parallel. 


Table 1-3. Example of Operand Forwarding 


Physical Register Contents 


ae ae 
i 
eee 2S eee ee eee 


SS ES eS eee 
ee ee eae 


Operand forwarding can only occur if the first instruction does not modify its source data. In 
other words, the instruction is amove type instruction (for example, MOV, POP LEA). Operand 
forwarding occurs for both register and memory operands. The size of the first instruction desti- 
nation and the second instruction source must match. 
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Exam ple #4 -—ResultForw arding Elim inates Read fterw rite (RAW ) Dependency 


In this example, a RAW dependency occurs when the first in a pair of instructions performs a 
write, and the second instruction reads the same register. This dependency is illustrated by the 
pair of instructions in the X and Y pipelines, as shown below: 


xX PIPE Y PIPE 


(1) ADD AX, BX —- (2) MOV[mem], AX 


AX —AX+ BX [mem] + AX 


The M II CPU uses result forwarding and avoids a Y pipe stall (‘Table 1-5). Instead of transferring 
the contents of the AX register to memory, the result of the previous ADD instruction (Reg0 + 
Reg1) is written directly to memory, thereby saving a clock cycle. 


Table 1-4. Result Forwarding Example 


Physical Register 
Contents 


reso [nest eee? [Pre 


jana) fae | me TT 
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worn [ [mA | ¥ | Gn Rg et 


The second instruction must be a move instruction and the destination of the second instruction 
may be either a register or memory. 
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1.2.4.3 Data Bypassing 


In addition to register renaming and data forwarding, the M II CPU implements a third data 
dependency-resolution technique called data bypassing. Data bypassing reduces the performance 
penalty of those memory data RAW dependencies that cannot be eliminated by data forwarding. 


Data bypassing is implemented when the first in a pair of instructions writes to memory and the 

second instruction reads the same data from memory. The M II CPU retains the data from the first 
instruction and passes it to the second instruction, thereby eliminating amemory read cycle. Data 
bypassing only occurs for cacheable memory locations. 


Exam ple #1-D ata Bypassing w ith Read~A fterw rite RAW ) Dependency 


In this example, a RAW dependency occurs when the first in a pair of instructions performs a 
write to memory and the second instruction reads the same memory location. This dependency is 
illustrated by the pair of instructions in the X and Y pipelines as shown below: 


X PIPE Y PIPE 
(1) ADD [mem], AX (2) SUBEX, [mem] 
[mem] <[mem] + AX BX < BX - [mem] 


The M II CPU uses data bypassing and stalls the Y pipe for only one clock by eliminating the Y 
pipe’s memory read cycle (Table 1-6). Instead of reading memory in the Y pipe, the result of the 
previous instruction ([mem] + RegQ) is used to subtract from Reg1, thereby saving amemory 
access cycle. 


Table 1-5. Example of Data Bypassing 


Physical Register 


Contents 
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1.2.5 Branch Control 


Branch instructions occur on average every 
four to six instructions in x86-compatible pro- 
grams. When the normal sequential flow of a 
program changes due to abranch instruction, 
the pipeline stages may stall while waiting for 
the CPU to calculate, retrieve, and decode the 
new instruction stream. The M IT CPU mini- 
mizes the performance degradation and 
latency of branch instructions through the use 
of branch prediction and speculative execu- 
tion. 
1.2.5.1 Branch Prediction 

The M II CPU uses a 512-entry, 4-way set asso- 
ciative Branch Target Buffer (BIB) to store 
branch target addresses. The M II CPU has 
1024-entry branch history table. During the 
fetch stage, the instruction stream is checked 
for the presence of branch instructions. If an 
unconditional branch instruction is encoun- 
tered, the M II CPU accesses the BIB to check 
for the branch instruction’s target address. If 
the branch instruction’s target address is found 
in the BIB, the M II CPU begins fetching at the 
target address specified by the BIB 


In case of conditional branches, the BIB also 
provides history information to indicate 
whether the branch is more likely to be taken 
or not taken. If the conditional branch instruc- 
tion is found in the BIB, the M II CPU begins 
fetching instructions at the predicted target 
address. If the conditional branch misses in the 
BIB, the M II CPU predicts that the branch 
will not be taken, and instruction fetching 
continues with the next sequential instruction. 


The decision to fetch the taken or not taken 
target address is based on a four-state branch 
prediction algorithm. 


Once fetched, a conditional branch instruction 
is first decoded and then dispatched to the X 
pipeline only. The conditional branch instruc- 
tion proceeds through the X pipeline and is 
then resolved in either the EX stage or the WB 
stage. The conditional branch is resolved in the 
EX stage, if the instruction responsible for 
setting the condition codes is completed prior 
to the execution of the branch. If the instruc- 
tion that sets the condition codes is executed 
in parallel with the branch, the conditional 
branch instruction is resolved in the WB stage. 


Correctly predicted branch instructions 
execute in asingle core clock. If resolution of a 
branch indicates that a misprediction has 
occurred, the M II CPU flushes the pipeline 
and starts fetching from the correct target 
address. The M II CPU prefetches both the 
predicted and the non-predicted path for each 
conditional branch, thereby eliminating the 
cache access cycle on a misprediction. If the 
branch is resolved in the EX stage, the 
resulting misprediction latency is four cycles. 
If the branch is resolved in the WB stage, the 
latency is five cycles. 


Since the target address of return (RET) 
instructions is dynamic rather than static, the 
MII CPU caches target addresses for RET 
instructions in an eight-entry retum stack 
rather than in the BIB. The return address is 
pushed on the return stack during a CALL 
instruction and popped during the corre- 
sponding RET instruction. 
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1.2.5.2 Speculative Execution 


The M II CPU is capable of speculative execu- 
tion following a floating point instruction or 
predicted branch. Speculative execution allows 
the pipelines to continuously execute instruc- 
tions following a branch without stalling the 
pipelines waiting for branch resolution. The 
same mechanism is used to execute floating 
point instructions (see Section 1.6) in parallel 
with integer instructions. 


The M II CPU is capable of up to four levels of 
speculation (i.e., combinations of four condi- 
tional branches and floating point opera- 
tions). After generating the fetch address using 
branch prediction, the CPU checkpoints the 
machine state (registers, flags, and processor 
environment), increments the speculation level 
counter, and begins operating on the predicted 
instruction stream. 


Once the branch instruction is resolved, the 
CPU decreases the speculation level. Fora 
correctly predicted branch, the status of the 
checkpointed resources is cleared. For a 
branch misprediction, the M II processor 
generates the correct fetch address and uses 
the checkpointed values to restore the machine 
state in a single clock. 


Tn order to maintain compatibility writes that 
result from speculatively executed instructions 
are not permitted to update the cache or 
external memory until the appropriate branch 
is resolved. Speculative execution continues 
until one of the following conditions occurs: 


1) Abranch or floating point operation 
is decoded and the speculation level 
is already at four. 


2) An exception or a fault occurs. 
3) The write buffers are full. 


4) An attempt is made to modify a 
non-checkpointed resource (i.e., 


segment registers, system flags). 
1.3 Cache Units 


The M II CPU employs two caches, the Unified 
Cache and the Instruction Line Cache (Figure 
1-2, Page 1-15). The main cache is a 4-way 
set- associative 64-KByte unified cache. The 
unified cache provides a higher hit rate than 
using equal-sized separate data and instruction 
caches. While in Cyrix SMM mode both SMM 
code and data are cacheable. 


The instruction line cache is a fully associative 
256-byte cache. This cache avoids excessive 
conflicts between code and data accesses in the 
unified cache. 


1.3.1 Unified Cache 


The 64-KByte unified write-back cache func- 
tions as the primary data cache and as the 
secondary instruction cache. Configured as a 
four-way set-associative cache, the cache stores 
up to 64 KBytes of code and data in 2048 
lines. The cache is dual-ported and allows any 
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two of the following operations to occur in 
parallel: 


© Code fetch 
e Dataread (X pipe, Y pipeline or FPU) 
¢ Data write (X pipe, Y pipeline or FPU) 


The unified cache uses a pseudo-LRU replace- 
ment algorithm and can be configured to allo- 
cate new lines on read misses only or on read 
and write misses. 


Instruction Data 


1.3.2 Instruction Line Cache 


The fully associative 256-byte instruction line 
cache serves as the primary instruction cache. 
The instruction line cache is filled from the 
unified cache through the data bus. Fetches 
from the integer unit that hit in the instruction 
line cache do not access the unified cache. If 
an instruction line cache miss occurs, the 
instruction line data from the unified cache is 
transferred to the instruction line cache and 
the integer unit, simultaneously. 


Integer Instruction 


Data Bus 


Unit Add Se Data 
Instruction Line Cache Bucs 
256- Byte Fully Associative, 8 Lines Fae 


Unified Cache 
64-KByte, 4-Way Set Associative, 2048 Lines 


Instruction 
Line Cache 
Miss Address 


Modified X, Y 
Physical Addresses 


Memory Management Unit 


1747900 


Figure 1-2. Cache Unit Operations 


=== = Dual Bus 
—— = Single Bus 
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The instruction line cache uses a pseudo-LRU 
replacement algorithm. To ensure proper oper- 
ation in the case of self-modifying code, any 
writes to the unified cache are checked against 
the contents of the instruction line cache. Ifa 
hit occurs in the instruction line cache, the 
appropriate line is invalidated. 

1.4 Memory 
Management Unit 


The Memory Management Unit (MMU), 
shown in Figure 1-3, translates the linear 
address supplied by the IU into a physical 
address to be used by the unified cache and 
the bus interface. Memory management proce- 


Linear 
Address 


dures are x86 compatible, adhering to stan- 
dard paging mechanisms. 


Within the M II CPU there are two TLBs, the 
main L1 TLBand the larger L2 TLB The 
16-entry L1 TLB is direct mapped and holds 
42 lines. The 384-entry L2 TLBis 6-way 
associative and hold 384 lines. The DTEis 
located in memory. 


Scratch Pad CacheM em ory 


The M II CPU has the capability to “lock 
down” lines in the Li cache on a line by line 
basis. Locked down lines are treated as private 
memory for use by the CPU. Locked down 
memory does not participate in hardware-- 
cache coherency protocols. 


Main L1 TLB 


Control Register 


Figure 1-3. 
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Paging Mechanism within the Memory 


Management Unit 
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Cache locking is controlled through use of the 
RDMSR and WRMS&R instructions. 


125 Floating Point Unit 


The M II Moating Point Unit (FPU) processes 
floating point and MMX instructions. The 
FPU interfaces to the integer unit and the 
cache unit through a 64-bit bus. The M II FPU 
is x87 instruction set compatible and adheres 
to the IKEE-754 standard. Since most applica- 
tions contain FPU instructions mixed with 
integer instructions, the M II FPU achieves 
high performance by completing integer and 
FPU operations in parallel. 


FPU ParallelExecution 


The M II CPU executes integer instructions in 
parallel with FPU instructions. Integer instruc- 
tions may complete out of order with respect 
to the FPU instructions. The M II CPU main- 
tains x86 compatibility by signaling exceptions 
and issuing write cycles in program order. 


As previously discussed, FPU instructions are 
always dispatched to the integer unit’s X pipe- 
line. The address calculation stage of the X 
pipeline checks for memory management 
exceptions and accesses memory operands 
used by the FPU. If no exceptions are detected, 
the M II CPU checkpoints the state of the CPU 
and, during AC2, dispatches the floating point 
instruction to the FPU instruction queue. The 
M II CPU can then complete any subsequent 
integer instructions speculatively and out of 
order relative to the FPU instruction and rela- 


tive to any potential FPU exceptions which 
may occur. 


As additional FPU instructions enter the pipe- 
line, the M II CPU dispatches up to four FPU 
instructions to the FPU instruction queue. The 
MII CPU continues executing speculatively 
and out of order, relative to the FPU queue, 
until the M II CPU encounters one of the 
conditions that causes speculative execution to 
halt. As the FPU completes instructions, the 
speculation level decreases and the check- 
pointed resources are available for reuse in 
subsequent operations. The M II FPU also uses 
aset of six write buffers to prevent stalls due to 
speculative writes. 


1.6 Bus Interface Unit 


The Bus Interface Unit (BIU) provides the 
signals and timing required by external 
circuitry. The signal descriptions and bus inter- 
face timing information is provided in 
Chapters 3 and 4 of this manual. 
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2.4.4 M II Configuration 


Registers 


The MII configuration registers are used to 
enable features in the M II CPU. These registers 
assign non-cached memory areas, set up SMM, 
provide CPU identification information and 
control various features such as cache write 
policy, and bus locking control. There are four 
groups of registers within the M II configura- 
tion register set: 


¢ 7 Configuration Control Registers (CCRx) 
e 8 Address Region Registers (ARRx) 
¢ 8 Region Control Registers (RCRx) 


Access to the configuration registers is achieved 
by writing the register index number for the 
configuration register to I/O port 22h. I/O port 
23h is then used for data transfer. 


Fach J/O port 23h data transfer must be pre- 
ceded by a valid /O port 22h register index 
selection. Otherwise, the current 22h, and the 
second and later /O port 23h operations com- 
municate through the I/O port to produce 
external I/O cycles. All reads from I/O port 22h 
produce external I/O cycles. Accesses that hit 
within the on-chip configuration registers do 
not generate external I/O cycles. 


After reset, configuration registers with indexes 
CO-CFh and FC-FFh are accessible. To prevent 
potential conflicts with other devices which 
may use ports 22 and 23h to access their regis- 
ters, the remaining registers (indexes DO-FBh) 
are accessible only if the MAPEN(3-0) bits in 
CCR3 are set to 1h. See Figure 2-16 (Page 
2-29) for more information on the 
MAPEN(3-0) bit locations. 


If MAPEN[3-0] = 1h, any access to indexes in 
the range 00-FFh will not create external I/O 
bus cycles. Registers with indexes CO-Ckh, 
FC- F¥h are accessible regardless of the state of 
MAPEN[3-0]. If the register index number is 
outside the CO-CFh or FC- FFh ranges, and 
MAPEN|3-0] are set to Oh, external I/O bus 
cycles occur. Table 2-11 (Page 2-25) lists the 
MAPEN|[3-0] values required to access each M 
Il configuration register. All bits in the config- 
uration registers are initialized to zero following 
reset unless specified otherwise. 


2.4.4.1 Configuration Control 


Registers 


(CCRO - CCR6) control several functions, 
including non-cacheable memory, write-back 
regions, and SMM features. A list of the config- 
uration registers is listed in Table 2-11 (Page 
2-25). The configuration registers are described 
in greater detail in the following pages. 
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Table 2-11. M II CPU Configuration Registers 


MAPEN VALUE 
REGISTER NAME ACRONYM rete ‘elie NEEDED FOR 
ACCESS 
Goninon Cantal [oR | Ow | 8 


Saipan! ox | aa [6 | 
Goninon Gomi Jooe ea [8 


wo) OM 


= 


ssi fa onan [oe 
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Figure 2-13. M II Configuration Control Register 0 (CCRO) 


Table 2-12. CCRO Bit Definitions 


BIT 
POSITION | NAME DESCRIPTION 


No Cache 640 KByte - 1 MByte 


If= 1: Address region 640 KByte to 1 MByte is non-cacheable. 
If= 0: Address region 640 KByte to 1 MByte is cacheable. 


Note: Bits 0, 2 through 7 are reserved. 
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Figure 2-14. M II Configuration Control Register 1 (CCR1) 


Table 2-13. CCR1 Bit Definitions 


BIT 
[posimion | NAME | i escntprion 
7 SM3 SMM Address Space Address Region 3 
If= 1: Address Region 3 is designated as SMM address space. 
If= 1: All bus cycles are issued with LOCK# pin negated except page table accesses and 
interrupt acknowledge cycles Interrupt acknowledge cycles are executed as locked 
cycles even though LOCK# is negated. With NO_LOCK set, previously noncacheable 


locked cycles are executed as unlocked cycles and therefore, may be cached. This 
results in higher performance. Refer to Region Control Registers for information on 


eliminating locked CPU bus cycles only in specific address regions. 


If= 1: Any access to addresses within the SMM address space, access system manage- 
ment memory instead of main memory. SMI# input is ignored. Used when initializing 
or testing SMM memory. 
If= 0: No effect on access. 
1 USE_SMI_ | Enable SMM and SMIACT# Fins 

Tf= 1: SMI# and SMIACT# pins are enabled. 
If= 0: SMI# pin ignored and SMIACT# pin is driven inactive. 

Note: Bits 0, 3, 5 and 6 are reserved. 
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Figure 2-15. M Il Configuration Control Register 2 (CCR2) 


Table 2-14. CCR2 Bit Definitions 


BIT 


7 USE_SUSP Use Suspend Mode (Enable Suspend Pins) 
If= 1: SUSP# and SUSPA# pins are enabled. 
If = 0: SUSP# pin is ignored and SUSPA# pin floats. 
WPR1 Write-Protect Region 1 
If = 1: Designates any cacheable accesses in 640 KByte to 1 MByte address region 
are write protected. 


3 SUSP_HIT Suspend on Halt 
If = 1: Execution of the HIT instruction causes the CPU to enter low power sus- 
pend mode. 


2 LOCK_NW Lock NW 
If = 1: NW bit in CRO becomes read only and the CPU ignores any writes to the 
NW bit. 
If = 0: NW bit in CRO can be modified. 
If = 1: CPU inserts an idle cycle following sampling of BRDY# and inserts an idle 
cycle prior to asserting ADS# 


Note: Bits 0, 5 and 6 are reserved. 
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Figure 2-16. M II Configuration Control Register 3 (CCR3) 


Table 2-15. CCR3 Bit Definitions 


BIT 


7-4 MAPEN(3-0) MAP Fnable 
If = 1h: All configuration registers are accessible. 
If =0h: Only configuration registers with indexes CO-CFh, FEh and FFh 
are accessible. 
If = 1: Use linear address sequence during burst cycles. 
If =0: Use “1 + 4” address sequence during burst cycles. The “1 + 4” address 
sequence is compatible with Pentium’s burst address sequence. 
NMI Fnable 
If =1: NM interrupt is recognized while servicing an SMI interrupt. 
NML_EN should be set only while in SVM, after the appropriate SMI interrupt 
service routine has been setup. 
SMI Lock 
If = 1: The following SMM configuration bits can only be modified while in an 
SMI service routine: 
CCR1: USE_SMI, SMAC, SM3 
CCR3: NMI_EN 
CCR6: N, SMM_MODE 
ARR3: Starting address and block size. 
Once set, the features locked by SMI_LOCK cannot be unlocked until the 
RESET pin is asserted. 


Note: Bit 3 is reserved. 
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Figure 2-17. M II Configuration Control Register 4 (CCR4) 


Table 2-16. CCR4 Bit Definitions 


BIT 


z | CPUID Enable CPUID instruction. 
If = 1: the ID bit in the EFLAGS register can be modified and execution of the 
CPUID instruction occurs as documented in section 6.3. 
If =0: the ID bit in the EFLAGS register can not be modified and execution of 
the CPUID instruction causes an invalid opcode exception. 


2-0 YO Recovery Time 
Specifies the minimum number of bus clocks between I/O accesses: 
Oh = 1 clock delay 
1h = 2 clock delay 
2h = 4 clock delay 
3h = 8 clock delay 
4h = 16 clock delay 
5h = 82 clock delay (default value after RESET) 
6h = 64 clock delay 
7h = no delay 


Note: Bits 3 - 6 are reserved. 
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Figure 2-18. M II Configuration Control Register 5 (CCR5) 


Table 2-17. CCR5 Bit Definitions 


BIT 


Enable ARR Registers 
If = 1: Enables all ARR registers. 
If =0: Disables the ARRregisters. If SMB is set, ARR3 is enabled regardless of 


the setting of ARREN. 


Write-Through Allocate 
If = 1: New cache lines are allocated for read and write misses. 
If =0: New cache lines are allocated only for read misses. 


Note: Bits 1 - 3 and6 - 7 are reserved. 
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Figure 2-19. M II Configuration Control Register 6 (CCR6) 


Table 2-18. CCR6 Bit Definitions 


BIT 


N Nested SMI Fhable bit: If operating in Cyrix enhanced SMM mode and: 
If = 1: Enables nesting of SMPs 
If = 0: Disable nesting of SMPs. 
This bit is automatically CLEARED upon entry to every SMM routine and is 
SET upon every RSM. Therefore enabling/disabling of nested SMI can only be 
done while operating in SMM mode. 


WP_ARR3 If =1: Memory region defined by ARR3 is write protected when operating out- 
side of SMM mode. 
If =0: Disable write protection for memory region defined by ARR3. 
Reset State = 0. 


ficaaeshaeneel MODE | If=1: Enables Cyrix Enhanced SMM mode. 
ficaeesuaamaele 0: Disables Cyrix Enhanced SMM mode. 


Note: Bit 1 is reserved. 
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2.4.4.2 Memory Address 
Region Registers 


The Address Region Registers (ARRO - ARR7) 
are paired with the Region Control Registers 
(RCRO-RCR7) and specify up to eight 
memory address regions. Using ARR/RCR 
pairs, these regions can be designated as 
non-cacheable, write through, write locking 
and weak locking. Register pairs ARR7/ RCR7 
are unique and can define attributes for all of 
system main memory. 


The 24-bit ARR registers (Figure 2-20) are 
divided into the 20-bit BASH ADDRESS and 
4-bit SIZE fields The fields define the size 
and base addresses for the memory regions. 


START ADDRESS 


The base address must be on ablock size 
boundary. For example, if a 128-KByte block is 
used, the base addresses are allow to be 0, 

128 and, 256 KBytes, and so forth. 


The meaning of the SIZE fields are listed in Table 
2-20. (Page 2-34). If the SIZE field is zero, the 
address region is zero sized and thus disabled. 


The ARR registers are accessed using I/O ports 
22h and 23h. To read or write to acomplete ARR 
register, three I/O port cycles are required. Each 
byte is assigned an index value (Table 2-19, Page 
2-34). 


A region is noncacheable, if defined by ARR/RCR 
pair even if KEN# is active. 


Memory Address Memory Address Memory Address —- Bits 
Bits A31-A24 Bits A23-A16 Bits A15-A12 3-0 


Figure 2-20. Address Region Registers (ARRO - ARR7) 
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Table 2-19. ARRO - ARR7 Register Index Assignments 


Register Bytes 
rinay | age Mowery’ || Base Memen: | Saarces reid 
Register Address Field Address Field ie bree 
sid dia Micra SIZE Field 


Table 2-20. SIZE Field Bit Definition 


BLOCK SIZE| BLOCK SIZE BLOCK SIZE | BLOCK SIZE 
SIZE (3-0) SIZE (3-0) 
ARRO -6 ARR7 
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2.4.4.3 Region Control 


Registers 


The Region Control Registers (RCRO - RCR7) 
are paired with the Address Region Registers 
(ARRO -ARR7). Each RCR register specifies the 
attributes associated with a particular address 
regions. These attributes include: cacheability, 
weak locking, write gathering, and cache write 
through policies. The bit definitions for the 
region control registers are shown in Figure 
2-21 (Page 2-36) and in Table 2-21 (Page 
2-36). 


If an addressis accessed thatis notin amemory 
region defined by the ARRx registers, the fol- 
lowing conditions will apply: 


e Ifthe memory address is cached, 
write-back is enabled if WB/WT# is 
returned high. 

e Writes are not gathered 

¢ Stong locking takes place 

e The memory access is cached, if KEN# is 
returned asserted. 


O verlapping C onditions D efined . If two 
regions specified by ARRx registers overlap and 
conflicting attributes are specified, the follow- 
ing attributes take precedence: 


¢ Write-back is disabled 

¢ Writes are not gathered 

¢ Strong locking takes place 

¢ The overlapping regions are 
non-cacheable. 
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Figure 2-21. Region Control Registers (RCRO-RCR7 ) 


Table 2-21. RCRO-RCR7 Bit Definitions 


BIT 
POSITION | NAME | DESCRIPTION 


INV_RGN | Inverted Region. If=1, applies controls specified in RCRx to all memory addresses out- 
side the region specified in corresponding ARR. Applicable to RCRO-RCR6 only. 
4 WT 


W rite-Through . If =1, defines the address region as write-through instead of 
write-back. 


W rite G athering. If =1, enables write gathering for the associated address region. 
W eak Lock ing. If =1, enables weak locking for that address region. 
| 0 = |c@ Cache D isable. If=1, defines the address region as non-cacheable. 


Note: Bits 1, 5 and 7 are reserved. 


2-36 PRELIMINARY 


System Register Set 2 


Inverted Region (NV RGN). Setting 
INV-RGN applies the controls in RCRx to all the 
memory addresses outside the specified address 
region ARRx. This bit effects RCRO-RCR6 and 
not RCR7. 


W rite Through W T).Setting WT defines the 
address region as write-through instead of 
write-back, assuming the region is cacheable. 
Regions where system ROM are loaded (shad- 
owed or not) should be defined as write- 
through. This bit worksin conjunction with the 
CRO_NW and PWT bits and the WB/WT# pin 
to determine write-through or write-back 
cacheability, 


W rite Gathering W G). Setting WG enables 
write gathering for the associated address 
region. Write gathering allows multiple byte, 
word, or Dword sequential address writes to 
accumulate in the on-chip write buffer. As 
instructions are executed, the results are placed 
in a series of output buffers. These buffers are 
gathered into the final output buffer. 


When access is made to a non-sequential mem- 
ory location or when the 8-byte buffer becomes 
full, the contents of the buffer are written on the 
external 64-bit data bus. Performance is 
enhanced by avoiding as many as seven memory 
write cycles. 


WG should not be used on memory regions that 
are sensitive to write cycle gathering. WG.can be 
enabled for both cacheable and non-cacheable 
regions. 
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W eak Locking W L). Setting WLenables 
weak locking for the associated address region. 
During weak locking all bus cycles are issued 
with the LOCK# pin negated (except when page 
table access occur and during interrupt 
acknowledge cycles.) 


Interrupt acknowledge cycles are executed as 
locked cycles even though LOCK+# is negated. 
With WLset previously non-cacheable locked 
cycles are executed as unlocked cycles and 
therefore, may be cached, resulting in higher 
CPU performance. 


Note that the NO_LOCK bit globally performs 
the same function that the WL bit performs on 
asingle address region. The NO_LOCK bit of 
CCRI enables weak locking for the entire 
address space. The WLbit allows weak locking 
only for specific address regions. WL is inde- 
pendent of the cacheability of the address 
region. 


Cache D isable CD ).Cache Disable - If set, 
defines the pe sae as non-cacheable. 
This bit works in conjunction with the CRO_CD 
and PCD bits and the KEN# pin to determine 
line cacheability. Whenever possible, the 
ARR/RCRcombination should be used to define 
non-cacheable regions rather than using exter- 
nal address decoding and driving the KEN# pin 
as the MII can better utilize its advanced tech- 
niques for eliminating data dependencies and 
resource conflicts with non-cacheable regions 
defined on-chip. 
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2.5 Model Specific 


Registers 


The CPU contains several Model Specific 
Registers (MSRs) that provide time stamp, 
performance monitoring and counter event 
functions. Access to a specific MSR through an 
index value in the HCX register as shown in 
Table 2-22 below. 


Table 2-22. Machine Specific 
Register 


DESCRIPTION VALUE 


Time Stamp Counter (TSC) 
Counter Event Selection and Control Register | lh | 
FeomareeComirtO ah 


Performance Counter #1 


The MSR registers can be read using the 
RDMSR instruction, opcode OF32h. During an 
MSR register read, the contents of the particular 
MSR register, specified by the HCX register, is 
loaded into the EDX:FAX registers. 


The MSR registers can be written using the 
WRMS&R instruction, opcode OF380h. During a 
MSR register write the contents of EDX:EAX are 
loaded into the MSR register specified in the 
ECX register. 


The RDMSR and WRMS&R instructions are 
privileged instructions and are also used to 
setup scratch pad lock (Page 2-61). 


2.6 Time Stamp Counter 


The Time Stamp Counter (TSC) Register 
MSK(10) is a 64-bit counter that counts the in- 
ternal CPU clock cycles since the last reset. The 
TSC uses a continous CPU core clock and will 
continue to count clock cycles even when the 
MI is suspend mode or shutdown. 


The TSC can be accessed using the RDMSRand 
WRMS&R instructions. In addition, the TSC can 
be read using the RDTSC instruction, opcode 
OF31h. The RDTSC instruction loads the con- 
tents of the TSC into EDX:EAX. The use of the 
RDTSC instruction is restricted by the Time 
Stamp Disable, (TSD) flag in CR4. When the 
TSD flag is 0, the RDTSC instruction can be ex- 
ecuted at any privilege level. When the TSD 
flag is 1, the RDTSC instruction can only be ex- 
ecuted at privilege level 0. 


2.7 Performance 


Monitoring 


Performance monitoring allows counting of 
over a hundred different event occurrences and 
durations. Two 48-bit counters are used: Per- 
formance Monitor Counter 0 and Performance 
Monitor Counter 1. These two performance 
monitor counters are controlled by the Counter 
Event Control Register MSR(11). The perfor- 
mance monitor counters use acontinuous CPU 
core clock and will continue to count clock cy- 
cles even when the M II CPU is in suspend 
mode or shutdown. 
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2.8 Performance 
Monitoring 


Counters 1 and 2 


The 48-bit Performance Monitoring Counters 
(PMC) Registers MSR(12), MSR(13) count 
events as specified by the counter event control 
register. 


The PMCs can be accessed by the RDMSR and 
WRMB&R instructions. In addition, the PMCs 
can be read by the RDPMC instruction, opcode 
OF83h. The RDPMC instruction loads the con- 
tents of the PMC register specified in the HCX 
register into EDX:EAX. The use of RDPMC in- 
structions is restricted by the Performance Mon- 
itoring Counter Enable, (PCE) flag in C4. 


When the PCE flag is set to 1, the RDPMC in- 
struction can be executed at any privilege level. 
When the PCEflag is 0, the RDPMC instruction 
can only be executed at privilege level 0. 


2.8.1 Counter Event 


Control Register 


Register MSR(11) controls the two internal 
counters, #0 and #1. The events to be counted 
have been chosen based on the micro-architec- 
ture of the M II processor. The contzol register 
for the two event counters is described in 
Figure 2-21 (Page 2-36) and Table 2-23 (Page 
2-40). 
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2.8.1.1 PM Pin Control 


The Counter Event Control register MSR(11) 
contains PM control fields that define the PMO 
and PM1 pins as counter overflow indicators or 
counter event indicators. When defined as 
event counters, the PM pins indicate that one or 
more events occurred during a particular clock 
cycle and do not count the actual events. 


When defined as overflow indicators, the event 
counters can be preset with a value less the 

81 and allowed to increment as events occur. 
When the counter overflows the PM pin be- 
comes asserted. 


2.8.1.2 Counter Type Control 


The Counter Type bit determines whether the 
counter will count clocks or events. When 
counting clocks the counter operates as a timer. 


2.8.1.3 CPL Control 


The Current Privilege Level (CPL) can be used 
to determine if the counters are enabled. The 
CP02 bit in the MSR11) register enables count- 
ing when the CPLis less than three, and the 
CP08 bit enables counting when CPLis equal to 
three. Ifboth bits are set, counting is not depen- 
dent on the CPL level; if neither bitis set, count- 
ing is disabled. 
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222 2 2 

6 5 4 3 2 21 16 15 10 9 8 7 6 5 0 
T; P|} C] C) C P|C|)C|;C 

C| M| T| P| P M| T| P| P 

1} 1); 1); 1) 1 0/0)0/]0 

* 3 | 2 3 | 2 


*Note: Split Fields 


Figure 2-22. Counter Event Control Register 


Table 2-23. Counter Event Control Register Bit Definitions 
BIT 
row [ewe [meno 
PM1 Define External PM1 Pin 
If = 1: PM1 pin indicates counter overflows 
If = 0: PM1 pin indicates counter events 
24 CT1 Counter #1 Counter Type 
If = 1: Count clock cycles 
If = 0: Count events (reset state). 
23 CP13 Counter #1 CPL3 Enable 
If = 1: Fnable counting when CPL=3. 
If = 0: Disable counting when CPL=3. (reset state) 
22 CP12 Counter #1 CPL Less Than 3 Enable 


If = 1: Enable counting when CPL < 3. 
If = 0: Disable counting when CPL < 3. (reset state) 


26,21-16 | TC1(5-0) | Counter #1 Event Type 
Reset state = 0 
Define External PMO Pin 
If = 1: PMO pin indicates counter overflows 
If = 0: PMO pin indicates counter events 
CTO Counter #0 Counter Type 
If = 1: Count clock cycles 
If =0: Count events (eset state). 
7 CP03 Counter #0 CPL3 Enable 
If = 1: Enable counting when CPL=3. 
If = 0: Disable counting when CPL-3. (reset state) 
CP02 Counter #0 CPL Less Than 3 Enable 


If = 1: Enable counting when CPL < 3. 
If = 0: Disable counting when CPL < 3. (reset state) 


10, 5-0 TCO(5-0) | Counter #0 Event Type 
Reset state = 0 


Note: Bits 10 - 15 are reserved. 
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2.8.2 Event Type and Description 


The events that can be counted by the performance monitoring counters are listed in Table 2-24. 
Fach of the 127 event types is assigned an event number. 


A particular event number to be counted is placed in one of the MSR(11) Event Type fields. 
There is a separate field for counter #0 and #1. 


The events are divided into two groups. The occurrence type events and duration type events. 
The occurrence type events, such as hardware interrupts, are counted as single events. The du- 
ration type events such as “clock while bus cycles are in progress” count the number of clock cy- 
cles that occur during the event. 


During occurrence type events, the PM pins are configured to indicate the counter has incre- 
mented The PM pins will then assert every time the counter increments in regards to an occur- 
rence event. Under the same PM control, for a duration event the PM pin will stay asserted for 
the duration of the event. 


Table 2-24. Event Type Register 


| NUMBER |COUNTER 0 [COUNTER 1 DESCRIPTION 


02h 
06h 
om | yes | yes [Cathe Missos Insrason Fach | Occurence _ 
12h Oecumenee 
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Table 2-24. Event Type Register (Continued) 


NUMBER [COUNTER O[COUNTER7[ —~—~—~=~=SdD SCRIPTION, ——=SO=~“~*~™SCSC*SPSC* 
ah 
es 


1Ah 


20h [yes | ves [Reed CC(‘“‘LCONCOC#*” 
path [yes | yes [Reed SSC~“~*S*~*~*—*—‘“‘—*S*~sé~sSC“‘C‘C‘*”' 
/ 2th [yes | no _[MMX Instruction ExecutedinXpipe ____—_—_—_—_—~([ Occurence | 
-2Dh__| ys | mo _[BMMSInstruction Heecuted —=—==SSS~SCSCSC«iccumence | 
[3mm no | ves [Reed 
2m | _ys | mo _|Saurating MMXinsiructions Brocuted_____—_—_—_—_—([ Occunence | 
(80h yes | mo [Remed$ Ss C“~*~*SC‘C#*S 
-3th__|_yes__| no [MK InstructionDaaRenis —=~=SCSCSC*drccuence | 
[ah [yes | mo [Reed COC=“—*~*~dSC‘CSC@’ 
| 33h | no | | yes 
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Table 2-24. Event Type Register (Continued) 


| NUMBER |COUNTER 0 [COUNTER 1 DESCRIPTION 


Reeved —SSC~—~sSCSSTCCC*d 
[ath [yes [0 [Retums iced ncomesty —________Deearenes 
- 37h | no | yes Return Predicted (Comeetiy and Theoret) [Occurence _ 
38h [yes | no | MMK instruction Miliply UnitInieriock [Duration 
-— 38h | _no | __yes__ | MODV7MOV@ Store Stall De to Previous Operdion | Duration 
a a TTT 
-— 90h | no | yes _[RSBOverfows —=S~*~SC~S*«* ec 
[aah [yes [no [BIBFale Fries ——SCSCS~*~S*~S*«~ ec 


3Bh yes Number of Clock Stalled Due to Full Write Buffers While | Duration 
Executing 


- am | no [yee | all on MII Tatrucion Wie to Hor Mine [Duration 
[36-3 [ ye | se Rene Pn — 
40h | yer | yes [IB TUB Mises(CodeorDaia) | Occurence 
[ath [yes [yes (TX THBDaia Miss———SSSSCSC~*~S~S~S*«~d ec 

@h 
[ath [yes | yes [Reed SSC—C—sSCSSC 
aan [ys [ys fisrons Dada fam 

[vee | 
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4.0 ELECTRICAL 
SPECIFICATIONS 
4.1 Electrical Connections 


This section provides information on electrical 
connections, absolute maximum ratings, 
recommended operating conditions, DC char- 
acteristics, and AC characteristics. All voltage 
values in Electrical Specifications are measured 
with respect to Veg unless otherwise noted. 


The M II CPU operates using two power supply 
voltages—one for the I/O (3.3 V) and one for 
the core (2.9 V). 


4.1.1 Power and Ground 


Connections and 
Decoupling 


Testing and operating the M II CPU requires 
the use of standard high frequency techniques 
to reduce parasitic effects. The high clock 
frequencies used in the M II CPU and its output 
buffer circuits can cause transient power surges 
when several output buffers switch output 
levels simultaneously. These effects can be 
minimized by filtering the DC power leads with 
low-inductance decoupling capacitors, using 
low impedance wiring, and by utilizing all of 
the Veg and GND pins. The M II CPU 
contains 296 pins with 25 pins connected to 
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Vecg (2.9 volts), 28 pins connected to Vec3 
(3.3 volts), and 53 pins connected to Vgg 
(ground). 


4.1.2 Pull-Up/ Pull-Down 


Resistors 


Table 4-1 lists the input pins that are internally 
connected to pull-up and pull-down resistors. 
The pull-up resistors are connected to Voc and 
the pull-down resistors are connected to Veg, When 
unused, these inputs do not require connec- 
tion to external pull-up or pull-down resis- 
tors. The SUSP# pin is unique in that it is 
connected to a pull-up resistor only when 
SUSP# is not asserted. 


Table 4-1. Pins Connected to Internal 
Pull-Up and Pull-Down Resistors 


SIGNAL [PINNO. [RESISTOR 
20-10 pull-up 
20-K2 pull-down (ws tex) 


Reserved 
TOK 
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4.1.3 Unused Input Pins 


All inputs not used by the system designer and 
not listed in Table 4-1 should be connected 
either to ground or to Vog. Connect active-- 
high inputs to ground through a 10 ka (+ 10%) 
pull-down resistor and active-low inputs to 
Voc through a 10 kQ (+ 10%) pull-up resistor 
to prevent possible spurious operation. 


4.1.4 NC and Reserved Pins 


Fins designated NC have no internal connec- 
tions. Pins designated RESV or RESERVED 
should be left disconnected. Connecting a 
reserved pin to a pull-up resistor, pull-down 
resistor, or an active signal could cause unex- 
pected results and possible circuit malfunc- 
tions. 


Table 4-2. 


4.2 Absolute Maximum 


Ratings 


The following table lists absolute maximum 
ratings for the M II CPU processors. Stresses 
beyond those listed under Table 4-2 limits may 
cause permanent damage to the device. These 
are stress ratings only and do not imply that 
operation under any conditions other than 
those listed under “Recommended Operating 
Conditions” Table 4-3 (Page 4-3) is possible. 
Exposure to conditions beyond Table 4-2 may 
(1) reduce device reliability and (2) result in 
premature failure even when there is no imme- 
diately apparent sign of failure. Prolonged 
exposure to conditions at or near the absolute 
Maximum ratings may also result in reduced 
useful life and reliability. 


Absolute Maximum Ratings 


Panaweren [win [MAX [UNTS[ NOTES 
[Sone Tempentwe || a0 |S 
Sel Wkge Vos | os | ao [vl 


Volage On Any Fin Nov io exceed Vood ma 
input Clamp Cument ig, | | 10 | _mA_| Rower pled 
Output Clamp Current, Ipx a Fower Applied 


PRELIM INARY 


Recommended Operating Conditions 4 


4.3 Recommended Operating Conditions 


Table 4-3 presents the recommended operating conditions for the M II CPU device. 


Table 4-3. Recommended Operating Conditions 


[__Panaweren SSN | MAX [unis [NOTES 
[5 Operding aw Temperine | 0 | 70_| °C _| Power dpnliad | 
ees Supply Whee aay) id sass [aa VP 
Tiers SupplyWiige@oW | 2a +f a0 | v |_| 
Tnx High-Level put Woke exo 200 | ass_[ v |_| 


Vinx CEKHigh Love pst Whee +20 [ss | v [id 
iz, LowtnelieputVotae | 0a fos | v |_| 
lou Hih-Level Output Camere [| 40 | mA | Woon | 
fo Tow-Lavel Output Game ee 
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4.4 DC Characteristics 


Table 4-4. DC Characteristics (at Recommended Operating Conditions) 1 of 2 


7 a a 


Vou High-Level Output Voltage =-1 [Ion=-1mA 


Input Leakage Current =e a < eae < Voc3 
For all pins (except those 
listed in Table 4-1). 


In; ~=—s Input Leakage Current WA | Vay= 2.4 V 
For all pins with internal Note 1 


pull-downs. 


Tnput leakage Current Vy = 0.45 V 
For all pins with internal pull-ups. 


[Gx _Tnput Caparincs | 

Coxe Output Capacitance || 20 ie J 
[ip WO Capatancs P88 
[Cos CK Capecitnce ||| 8 Re 


*Note: Not 100% tested 
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Table 4-5. DC Characteristics (at Recommended ae i 2 of 2 


log PE a 1,2 
75/225 MHz (MII -300) 
66/233 MHz (MII -300) 
83/250 MHz (MII -333) 


Active Iog Notes 1, 2, 3 
75/225 MHz (MII -300) 
66/233 MHz (M II -300) 
83/250 MHz (MII -333) 


Togs Standby Ig, Notes 1, 2, 4 
0 MHz ee re 50.0 


1. These values should be used for power supply es Maximum Icc is determined oe worst-case instruction sequences and 
functions at maximum Vcc. 

2. Frequency (MHz) ratings refer to the internal clock frequency. 

3. All inputs at 0.4 or Vocg - 0.4 (CMOS levels). All inputs held static except clock and all outputs unloaded (static Iptyp = 0 mA). 

4. All inputs at 0.4 or Vocg - 0.4 (CMOS levels). All inputs held static and all outputs unloaded (static Inyp = 0 mA). 


Table 4-6. Power Dissipation 


PARAMETER ae] | uinias | nores | 


a ca RO 


Active Power Dissipation Cae 1 
75/225 MHz (M II -300) 12.5 20.7 
66/233 MHz (M II -300) 13.0 21.6 
83/250 MHz (MII -333) 14.0 23.3 


Suspend Mode Power Dissipation Notes 1, 2 
75/225 MHz (MII -300) 0.150 Ww 
66/233 MHz (MII -300) 0.152 
83/250 MHz (MII -333) 0.157 


Standby Mode Power Dissipation Notes 1, 3 
0 MHz (Suspended/CLK Stopped) 0.070 


Notes: 1. Systems must be designed to thermally dissipate the maximum active power dissipation. Maximum power is determined using the 
worst-case instruction sequences and functions with Vec2 = 2.9 V and Vec3 = 3.3 V. 
2. All inputs at 0.4 or Vogg - 0.4 (CMOS levels). All inputs held static except clock and all outputs unloaded (static Igyyp = 0 mA). 


3. All inputs at 0.4 or Vocg - 0.4 (CMOS levels). All inputs held static and all outputs unloaded (static Inu = 0 mA). 
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4.5 AC Characteristics 


Tables 4-7 through 4-12 (Pages 4-8 through 
4-11) list the AC characteristics including 
output delays, input setup requirements, input 
hold requirements and output float delays. 
These measurements are based on the 
measurement points identified in Figure 4-1 
(Page 4-7) and Figure 4-2 (Page 4-8). The rising 
clock edge reference level Vppp and other 


reference levels are shown in Table 4-7. Input 
or output signals must cross these levels during 
testing. 

HKigure 4-1 shows output delay (A and B) and 
input setup and hold times (C and D). Input 
setup and hold times (C and D) are specified 
minimums, defining the smallest acceptable 


sampling window a synchronous input signal 
must be stable for correct operation. 


The JIAG AC timing is shown in Table 4-13 


(Page 13) supported by Figures 4-6 (Page 
4-13) though 4-8 (Page 4-14). 
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“Ox mn | 


[ff ts VT 


A- Maximum Output Delay Specification 

B- Minimum Output Delay Specification 

C- Minimum Input Setup Specification 

D - Minimum Input Hold Specification 1709406 


Figure 4-1. Drive Level and Measurement Points for Switching Characteristics 


Table 4-7. Drive Level and Measurement 
Points for Switching Characteristics 


VOLTAGE 
SYM BOL (Volts) 


Note: Refer to Figure 4-1. 
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Table 4-8. Clock Specifications 
Toagr = 0°C to 70°C, See Figure 4-2 


60-MHz BUS 66-MHz BUS 75-MHz BUS 83-MHz BUS 
PARAMETER UNITS 


| MIN | MAX | MIN | | MAX [ MIN | MAX | 
a 
7 [| 150 es 
50 
ee Erm 40 | 
as oa 015 | 15 | 015 | 15 | ms 
ois | 45 [015 | 15 | ms 


| Max | MIN | 
ee [eo 
al [| 50 |_| 250 | pe | 
a 
[40] 
v6 [25 [015 
rig [aiktastine [ors | 15} os [15 


1740502 


Figure 4-2. CLK Timing and Measurement Points 
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Table 4-9. Output Valid Delays 
C= 50 pE Tease = 0°C to 70°C, See Figure 4-3 


Si amr[w [oeemr-e | 
PARAMETER UNITS 
— eu 51 


eae ape aera ayer 

LOCK#, PCD, PWT; SCYC, 

SMIACT#, W/R# 

re[AD—SSCS~—rSLO | FO | BO | GO | Ao | Go | 10 
ee re oe es Tec 
ear ee gest 
a [AP SS~w | 8 tO] 85 | of 85 | 10 75 | ns 
Teeny ——SSSC*d CO | 80 | 10] Gs | 10 | 68 | 10 | 60 | ns 
rib [aM ———SSSCS~s aa | 60 | ta | Go | aa] 60] 11] 55 | ns 
[ria weg a0 [80 [a0 80 [0 | 30 [20 [70 [oo | 
a 
rr [sume 0 eo to eo tof 80] 20 70 Tos J 


1740900 


Figure 4-3. Output Valid Delay Timing 


PRELIM INARY 4-9 


ae 
Gyrix AC Characteristics 
A 


advancing the Standards 


Table 4-10. Output Float Delays 
Cy, = 50 pE Tease = O°C to 70°C, See Figure 4-5 


Fann qc BUS ce ira BUS|7 aN ax BUS ice MHz BUS 
PARAMETER UNITS ITS 


api «0 | 0 [0 [00 || 
T17|D6s-Do, DPEDPOWW) | [100 [| 1o0| | 100[ | 100| ns 


T15 - T17 


174 1000 


Figure 4-4. Output Float Delay Timing 
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Table 4-11. Input Setup Times 
Tease = 0°C to 70°C, See Figure 4-5 


60-MHz | 66-MHz | 75-MHz | 83-MHz 
SYM BOL PARAMETER BUS BUS BUS BUS UNITS 


Table 4-12. Input Hold Times 
Tease = O°C to 70°C, See Figure 4-5 


60-MHz 66-MHz 75-MHz 83-MHz 
SYM BOL PARAM ETER =a a] a rin UNITS 


a OR S|. t0 fn 
ose [aNOID.EORY ant 


T3ib 
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Figure 4-5. Input Setup and Hold Timing 
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Table 4-13. JTAG AC Specifications 


ALL BUS FREQUENCIES 
SYM BOL PARAM ETER UNITS 


Sa 0000 
[ta [T0KHigh Time ——SSCSC~—~—~—SC—~—SCiSCSSSCS*~iCiae iC 
/—t88_[TCKow Time ——S~—<C~*~é‘SC‘ CSSCS*~“~*‘“TCe:~«*dt=C( CY 
[139 [T0KRaeTime ——SC~—~—~—sC—~‘iSSCSC“‘“‘dESCCd Cr | 
[M0 [Kame ——SCS—~—~—SsSSSSSCSC~sSC<“SC*dSCie iY 
[M3 [ TO Moat Dewy ——SSSCSC~idSC“‘CSNC*PSOUOC#O@BSCSC*dSSCms_—*|—_ 
/ Td | Non-test Outputs MloatDeay «| ——S«dY~Si id] |_| 
5 [TRS Pike With ——SSCSC—~—~idCSC“‘ OU C*d Ce (| 
746 [TDL TMS Sehp Time ——SCS~dCSC‘“‘(_OCPSSCSC~*~iSCiae «| 
/—T47__[ Non-testInpuis Setup Time ‘| _20~+| +m | 47_| 
- Ms [TDL TWMSHold Time ———SCS~idCSC‘iS*dPSSCSCS~dCms | 
on- pu me ns 
79 Non-text Inputs Holme _——~«d| tsi SSSCSC~dCms | 


1741102 


Figure 4-6. TCK Timing and Measurement Points 
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mn mm 


OUTPUT 
SIGNALS 


INPUT TAAVAY VYVVVYVV 
SIGNALS — {XXX MAAK 


1740400 


Figure 4-7. JTAG Test Timings 


1741200 


Figure 4-8. Test Reset Timing 
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5.0 MECHANICAL SPECIFICATIONS 
51 296-Pin SPGA Package 


The pin assignments for the M IT CPU in a 296-pin SPGA package are shown in Figure 5-1. 
The pins are listed by signal name in Table 5-1(Page 5-3) and by pin number in Table 5-2 (Page 
5-4). Dimensions are shown in Figure 5-2 (Page 5-5) and Table 5-3 (Page 5-6). 


37 36 BH BM 33 22 31 3 WO BW HH A BL A HD 19 18 17 1 15 14 13:12 1110 9 8 7 6 5 4 3 2 


fe] fe) fe) fe) ° fe) fe} fe) ° fe} ° ° ° 
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Figure 5-1. 296-Pin SPGA Package Pin Assignments 
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Figure 5-2. 296-Pin SPGA Package Pin Assignments (Bottom View) 
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Table 5-1. 296-Pin SPGA Package Signal Names Sorted by Pin Number 
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Table 5-2. 296-Pin SPGA Package Signal Names Sorted by Signal Names 
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Figure 5-3. 296-Pin SPGA Package A 
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Table 5-3. 296-Pin SPGA Package A 


MILLIM ETERS = ae ES 
Tavwao: | CeeRIETON oF wuiwerens [we 


M EASU RM ENT 


Fe apatites te ep 

spreader 

le ee eee 
package 


[—B_[Pndimeir ——=S=~=~“~*‘dCiBSY CY] 00m 
/_D_ [Overall package dimension | _4928_[ aot 19% | _1965_| 


D2 Vert. and hoz. heat spreader, 31.37 32.13 Sq. 1.235 1.265 
measured edge to edge. 

D3 Top metalization vert. and hoz., 33.43 34.42 1.316 1.355 
measured edge to edge 


Di [op mealizaion to topedge | 749 | 671 0m5 | 0364 
"Bi [Frrowolumm inearpin ping | 241 | 267 | 0.095 | 0.105 
/ 1B _[Rextroweohumn near pin spacmg | Lid [140 | 0.085 | 0.055 
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Figure 5-4. 296-Pin “Flip-Chip C4” SPGA Package B 
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Table 5-4. 296-Pin “ Flip-Chip C4” SPGA Package B Dimensions 


M ILLIM ETERS 
SYMBOL 


| MIN” | Max [| MIN” [ Max | 
[Bs | _oas_|_os1__[ or” | aoa _| 
>| 928 |91__[ 1940 | 1965 _| 


| 2a | 267 | 0005 | o105_| 
ei [ 10 [005 | 0055 | 
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Thermal Resistances 


Three thermal resistances can be used to idealize the heat flow from the junction of the M II CPU 
to ambient: 


= thermal resistance from junction to case in °C/W 
8cg = thermal resistance from case to heatsink in °C/W, 
Oga = thermal resistance from heatsink to ambient in °C/W, 
8c = 9cg + Oga, thermal resistance from case to ambient in °C/W. 
To =Ta + P* Oca (where T, = ambient temperature and P= power applied to the CPU). 


To maintain the case temperature under 70°C during operation 8c, can be reduced by a heat- 
sink/fan combination. (The heatsink/fan decreases 8c. by a factor of three compared to using a 
heatsink alone.) The required 8c, to maintain 70°C is shown in Table 5-4. The designer should 
ensure that adequate air flow is maintained to control the ambient temperature (T,). 


Table 5-3. Required Oc, to Maintain 70°C Case Temperature 


cm [40°C | 
a0 [7 | se [om [om [im [ie | 
[ae6 [3128 [220 [ie ie [87 | 
[a8 [ear [ie [ie [i | | 


200 [0 aor [aa [ae 5 a | 


*Note: Power based on Max Active Power values from Table 4-6, Page 4-5. 
Refer to the Cyrix Application AP105 titled “Thermal Design Considerations” for more information. 


Atypical 9; value for the M II 296-pin PGA-package value is 0.5 °C/W. 
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Appendix 


Ordering Information 


MIL - 300 GP 


Device Name ee | 


Mil 


Performance 


Package Type 
G = PGA Package 


Temperature Range 
P= Commercial 


1740002 


Note: For further information concerning Performance Ratings, visit our website at www.cyrix.com. 
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The Cyrix M II CPU part numbers are listed below. 


Cyrix M II™ Part Numbers 


FREQUENCY 
CLOCK 


MH 
PART NUMBER MUPTIPLIER (M Hz) 
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‘1+4’ Burst Read Cycle 
A 


AC Characteristics 

Address Bus Signals 

Address Parity Signals 

Address Region Registers (ARRx) 
Address Space 

Architecture Overview 


Back-Off Timing 

Base Field, Instruction Format 
Branch Control 

Burst Cycle Address Sequence 
Burst Write Cycles 

Bus Arbitration 

Bus Arbitration 

Bus Cycle Control Signals 

Bus Cycle Definition 

Bus Cycle Types Table 

Bus Cycles, Non-pipelined 
Bus Hold, Signal States During 
Bus Interface 

Bus Interface Unit 

Bus State Definition 

Bus State Diagram for M II 
Bus Timing 

Cc 


Cache Coherency Signals 

Cache Control Signals 

Cache Control Timing 

Cache Disable, Overall (CRO-14) 
Cache Disable by Region 

Cache Inquiry Cycles 

Cache Inquiry Cycles, SMM Mode 


INDEX 


Cache Organization 

Cache Units 

Caches, Memory 

CCRO Rit Definitions 

CCR1 Bit Definitions 

CCR2 Bit Definitions 

CCR3 Bit Definitions 

CCR4 Bit Definitions 

CCR5 Bit Definitions 

CCR6 Bit Definitions 

Clock Control Sgnals 

Clock Count for CPU Instructions 
Clock Count for FPU Instructions 
Clock Count for MMX Instructions 
Clock Specifications 
Configuration Control Registers 
Control Registers 

Counter Event Control Register 
CPUID Instruction 

Cyrix Enhanced SMM Mode 


D 


Data Bus Signals 
Data Bypassing 
Data Forwarding 
Data Parity Signals 
DC Characteristics 
Debug Register 
Descriptors 
Differences Between. 
M1 and 6x86 Processors 


E 

Hectrical Specifications 
Error Codes 

Event Type Register 
EWBE# Timing 
Exceptions 

Exceptions in Real Mode 
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F 

Fags Register 

HMoating Point Unit 

FPU Error Interface 

FPU Error Interface Signals 
FPU Operations 
Functional Blocks 


G 


Gate Descriptors 
Gates, Protection Level Transfer 


V/O Address Space 

Index Field, Instruction Format 
Initialization and Protected Mode 
Initialization of the CPU 

Input Hold Times 

Input Setup Times 

Inquiry Cycles Using AHOLD 
Inquiry Cycles Using BOFF# 
Inquiry Cycles Using HOLD/HLDA 
Instruction Fields, General 
Instruction Line Cache 
Instruction Pointer Register 
Instruction Set Overview 
Instruction Set Summary 
Instruction Set Tables 

Instruction Set Tables Assumptions 
Integer Unit 

Interrupt Acknowledge Cycles 
Interrupt and Exception Priorities 
Interrupt Control Signals 
Interrupt Vectors 

Interrupts and Exceptions 


J 


JTAG AC Specifications 
JTAG Interface 


L 


Lock Prefix 2-3 
M 

Maximum Ratings, Absolute 4-2 
Mechanical Specifications 5-1 
Memory Addressing 2-50 
Memory Addressing Methods 2-48 
Memory Management Unit 1-16 
MESI States, Unified Cache 2-57 
MMX Operations 2-89 
mod and vm Fields, Inst. Format 6-6 
Mode State Diagram 2-81 
Model Specific Registers 2-38 
a 
NC and Reserved Pins 4-2 
Non-pipelined Burst Read Cycles 3-30 
Non-pipelined Bus Cycles 3-27 
Oo 

Offset Mechanism 2-49 
Opcode Field, Instruction Format 6-4 
Out-of-order Processing 1-5 
Output Moat Delays 4-10 
Output Valid Delays 4-9 
P 

Package, Mechanical Drawing 5-5 
Paging Mechanisms (Detail) 2-52 
Performance Monitoring 2-38 


Performance Monitoring Event Type 2-41 
Pin Diagram, 296-Pin SPGA Package 5-1 
Pin List, Sorted by Pin Number 5-3 
Pin List, Sorted by Signal Name 5-4 


Pipeline Stages 1-5 
Pipelined Back-to-Back R/W Cycles 3-38 
Pipelined Bus Cycles 3-36 
Power and Ground Connections 4-1 
Power Dissipation 4-5 


Power Management Interface Signals 3-19 
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Power Management Interface Timing 3-60 SMM Operation 2-76 
Prefix Field, Instruction Format 6-3 Speculative Execution 1-14 
Privilege Level, Requested 2-8 ss Field, Instruction Format 6-9 
Privilege Levels 2-82 Stopping the Input Clock 3-62 
Programming Interface 2-1 Suspend Mode Signal States Table 3-21 
Protected Mode Address Calculation 2-50 Suspend Mode, HALT Initiated 3-61 
Protection, Segment and Page 2-82 System Management Mode(SMM) = 2-70 
Pull-Up and Pull-Down Resistors 4-1 T 

i ——_——_ Task Register 2-21 
RAW Dependency Example 1-10 Test Registers 2-46 
Recommended Operating Conditions 4-3 Thermal Characteristics 5-7 
reg Field, Instruction Format 6-7 Time Stamp Counter 2-38 
Region Control Registers (RCRx) 2-36 Timing, Bus 3-23 
Register Renaming 1-6 Translation Lookaside Buffer 2-52 
Register Sets 2-4 Translation Lookaside Buffer Testing 2-54 
Registers, Control 2-13 U 

Registers, General Purpose 2-5 

Registers, M Il Configuration 2-24 Unified Cache 1-14 
Registers, System Set 2-11 Unified Cache Testing 2-58 
Requested Privilege Level 2-8 V 

Reset Control Signals 3-7 

5 Ww — 
Scratchpad Memory Locking 2-61 WAR Dependency Example 1-7 
Segment Registers 2-7 WAW Dependency Example 1-8 
Selector Mechanism aa Weak Locking 2-37 
Shutdown and Halt 2-80 Write Through 237 
Signal Description Table 3-2 

Signal Groups 3-1 

SL-Compatible SMM Mode 2-78 

SMHR Register 2-74 

SMI# Interrupt Timing 3-40 

SMM Instructions 2-75 

SMM Memory Space 2-71 

SMM Memory Space Header 2-72 
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